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This article describes an approach to modeling knowledge acquisition in terms of walks along 
complex networks. Each subset of knowledge is represented as a node, and relations between such 
knowledge are expressed as edges. Two types of edges are considered, corresponding to free and 
conditional transitions. The latter case implies that a node can only be reached after visiting pre- 
viously a set of nodes (the required conditions). The process of knowledge acquisition can then be 
simulated by considering the number of nodes visited as a single agent moves along the network, 
starting from its lowest layer. It is shown that hierarchical networks, i.e. networks composed of 
successive interconnected layers, arise naturally as a consequence of compositions of the prerequisite 
relationships between the nodes. In order to avoid deadlocks, i.e. unreachable nodes, the subnetwork 
in each layer is assumed to be a connected component. Several configurations of such hierarchical 
knowledge networks are simulated and the performance of the moving agent quantified in terms of 
the percentage of visited nodes after each movement. The Barabasi- Albert and random models are 
considered for the layer and interconnecting subnetworks. Although all subnetworks in each realiza- 
tion have the same number of nodes, several interconnectivities, defined by the average node degree 
of the interconnection networks, have been considered. Two visiting strategies are investigated: 
random choice among the existing edges and preferential choice to so far untracked edges. A series 
of interesting results are obtained, including the identification of a series of plateaux of knowledge 
stagnation in the case of the preferential movements strategy in presence of conditional edges. 

PACS numbers: 89.75.Hc, 84.35.-K, 89.75.Fb 



Knowledge must come through action. (Sophocles) 

It is no good to try to stop knowledge from going for- 
ward. (Enrico Fermi) 

What must the world be like in order that man may 
know it? (Thomas Kuhn) 



I. INTRODUCTION 

Science is the art of building good models of nature, 
including science itself. This is the subject of the present 
article, i.e. to revisit the problem of modeling how knowl- 
edge is represented and acquired in the new light of com- 
plex network research. 

Modeling involves representations of the phenomenon 
of interest as well as the dynamics unfolding in such 
representations in a way which should be systemati- 
cally consistent with repetitive confrontation with exper- 
imental data. Because of their generality, complex net- 
works 0, 0, provide a natural and powerful resource 
for representing structures of knowledge, where facts are 
expressed as nodes and relations between facts are indi- 
cated by edges. Such an approach allows the process of 
knowledge acquisition to be modeled in terms of the num- 
ber of nodes (or edges) visited during walks through the 
knowledge network representation. The present work de- 
scribes a simple approach to knowledge acquisition based 
on complex networks and single- agent random walks. 



The plan of the article is as follows. After revising the 
main related works, focusing on knowledge representa- 
tion and random walks in scale free networks, each of the 
hypotheses adopted in our model are justified and dis- 
cussed. Among other issues, it is shown that hierarchical 
networks arise naturally as a consequence of composition 
of the prerequisites implied by the conditional links. Ab- 
sence of deadlocks (in the sense of node unreachability) in 
conditional transitions are avoided by providing that the 
networks at each layer correspond to a connected com- 
ponent^. e. any node in a layer can be reached through 
paths from any other nodes in that same network). Hi- 
erarchical complex networks (e.g. 0,0) include a series 
of layers, each containing a subnetwork, which are inter- 
connected through subnetworks. In the proposed model, 
conditional links are restricted to those connections be- 
tween successive layers. Two types of random walks are 
considered, involving random transitions as well as tran- 
sitions favoring new links. The simulations and respec- 
tively obtained results are presented and discussed next, 
followed by the development of an analyitical model of 
plateaux formation. The article concludes by presenting 
several perspectives for further developments. 



II. A BRIEF REVIEW OF RELATED 
CONCEPTS AND DEVELOPMENTS 

The subject of knowledge representation provides one 
of the main issues in artificial intelligence (e.g. 
Several discrete structures, including graphs and trees, 
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have been considered for the representation of knowledge. 
Of particular interest are semantic networks, which code 
each concept as a node and several relationships between 
such elements (e.g. proximity, precedence, relative po- 
sition, etc.) are encoded as edge labels. However, such 
structures are mainly considered as a reference for in- 
ferences during pattern analysis, not as a substrate over 
which to perform walks or explorations. The possibility 
to connect nodes through logical expressions associated 
to nodes has provided one of the main features of ran- 
dom Boolean networks 0, ^| . These expressions have 
been used mainly to combine local states of the nodes, 
not to control random walks. The possibility to asso- 
ciate control on the flow between nodes in graphs has 
been adopted in Petri nets (e.g. ^]]]), which has been 
used mainly for simulating computing and logical cir- 
cuits. The subject of random walks itself corresponds 
to a well-developed and important area in statistical me- 
chanics (e.g. [12 )• The analysis of random walks in scale- 
free networks has been addressed by Tadic in 0, Q 
regarding a special type of network aimed at simulating 
the Web, and by Bollt and Avraham and Noh and 
Rieger [16] considering recursive and hierarchical scale- 
free networks, the latter being concerned with the de- 
terministic type of hierarchical network proposed in ^3] • 
Random hierarchical networks similar to those consid- 
ered in the present work have been introduced in 0, [f| . 



III. HYPOTHESES 

Representability of Knowledge as a Network: The 
basic assumption underlying the present work is that 
knowledge can be represented as a complex network. 
First, it is understood that knowledge can be partitioned 
into chunks which are henceforth represented as network 
nodes, while relations between such subsets are repre- 
sented as edges. Two types of edge transitions are con- 
sidered in this work: free and conditional. In the first 
case, one is allowed to move freely from a node to the 
neighboring node, and to come back. The latter type of 
transition requires the moving agent to have passed first 
along a set of nodes which represent the condition for the 
movement. The process of learning can then be modeled 
in terms of the number of nodes (or edges) during walks 
proceeding along the respective knowledge network. 

Figure da) illustrates a free transition between two 
subsets a and b of knowledge, while the example in (b) 
expresses the simplest conditional case where the moving 
agent can go from b to a, but not from a to 6, unless it 
has already made at least one move from b to a. It is 
also possible to have hybrid situations such as those de- 
picted in Figure^c), where a can be reached from b or c, 
but only c can be reached from a. In order to allow the 
representation of multiple conditions (i.e. the fact that 
a can only be reached after visiting b±, b%, . . . , 6^, A be- 
ing a positive integer) we introduce the concept of token 
controlled network. This multiple conditional case is il- 



lustrated in Figure^d). Here, the subset of knowledge a 
can only be reached after visiting b and c (in any order). 
In other words, it is as if the agent would be collecting 
a token from each of the required nodes, which it keeps 
henceforth as keys allowing them to proceed through the 
respective conditional nodes. In the present work, it is 
assumed that all conditional nodes, i.e. those having to- 
kens required for movement to a node n, are connected 
to node n through directed edges. It is also possible to 
have alternative multiple conditions, as illustrated in Fig- 
ure^e), where the labels associated to the edges identify 
the respective conditional structures. In this case, a can 
be reached if and only if both b and c were visited before; 
or after visiting d and e. The case in which a node a 
can be accessed after visiting b or c is represented by two 
undirected edges, without associated labels, from those 
nodes to a. In brief, the free edges are represented by 
undirected edges, and the conditional by directed arrows 
with associated identifying labels. Alternative multiple 
conditions are not considered in the present work in or- 
der to limit the complexity and number of parameters in 
the experimental and analytical characterization of the 
dynamics of knowledge acquisition. 




FIG. 1: Types of relationships between knowledge subsets 
(i.e. nodes): equivalence (a); implication (b); hybrid rela- 
tionship involving equivalence and implication (c); multiple 
implication (d) and alternative implications (e). 

Regarding the movement of agents along such networks 
as they integrate the knowledge available from the nodes, 
it is natural that a free transition can be tracked in any 
direction. However, a conditional edge from b and c to 
a is considered to be direction restrictive only until a 
is reached for the first time (after visiting b and c), be- 
coming a free edge henceforth. This type of dynamics 
is implemented in order to express the fact that once 
knowledge about a, b and c is achieved, i.e. the condi- 
tional transition is mastered, it becomes possible to reach 
any of the conditions from node a. 

Hierarchical Knowledge Networks: The indiscriminate 
incorporation of the multiple conditions into a network 
can easily lead to deadlocks such as that illustrated in 
Figure [3 This deadlock is a direct consequence of the 
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fact that there is no path connecting i and j in the net- 
work represented in this figure. We henceforth assume 
that the knowledge network is consistent, in the sense 
that all nodes should be reachable. In addition, the sev- 
eral prerequisites between the portions of knowledge as- 
signed to nodes naturally define a hierarchy along the 
networks. For instance, some knowledge at node i may 
require previous visits to nodes j and fc, which we shall 
represent as i = P(j,k). The access to k and j may 
demand previous visits to nodes m and n and q and r, 
respectively, represented by the composition of prereq- 
uisites i — P{j,k) — P(P(m,n), P(q,r)), implying two 
hierarchies. Therefore, a network of consistent knowl- 
edge can be general and naturally organized as a hierar- 
chy of H layers, with the first layer corresponding to all 
nodes which have no prerequisites, while the remainder 
of the nodes are partitioned into layer by the composed 
prerequisites. Each layer h contains a connected subnet- 
work (i.e. any node in the subnetwork can be reached 
through at least one path from any node) which is inter- 
connected, via conditional edges, to nodes in layer ft, — 1. 
It is important to note that, although related to pre- 
vious works such as 0, , the hierarchical organization 
for knowledge representation is self-contained and follows 
naturally from knowledge consistence and the composi- 
tion of prerequisites between nodes. The total number of 
layers is henceforth expressed as H , the number of nodes 
in layer h as N(h), while the number of nodes in the 
whole hierarchical network is denoted as f2. 




FIG. 2: The indiscriminate use of multiple conditional im- 
plications quickly leads to deadlocks such as that illustrated 
here. The subset of knowledge in node k can never be reached 
by an agent starting at i or j, as there is no connection be- 
tween these two nodes. 

Figure [3] illustrates a simple hierarchical knowledge 
network containing three layers. For simplicity's sake, 
hybrid relationships or alternative implications are not 
considered henceforth. In addition, all network layers 
are assumed to be of the same type (e.g. random or 
Barabasi-Albert — BA) and have the same number of 
nodes and average node degree. The nodes at the high- 
est hierarchy are called assumptions, and are the place 
where all the walks start. Note that the highest hierar- 
chical levels are found at the lowest portion of Figure |3J 

The set of interconnecting networks is also of uniform 
type and have the same number of nodes and edges. In 




FIG. 3: A example of simple hierarchical network. 



the current work, these subnetworks can be of random 
(i.e. Erdos-Renyi) or BA types, defining how the sub- 
network in layer h + 1 connects to the nodes in layer h. 
Figure 0] illustrates how such interconnections are hence- 
forth understood. The layers h (Figure 0Ja)), and h + 1 
(Figure 0(c)) are to be connected through the intercon- 
nection subnetwork h in Figure Efb). Each edge (i,j) in 
the interconnection layer implies that node i in layer h 
is connected to node j in layer h — 1 and that node j in 
layer h is connected to node i in layer h — 1. Note that 
although a more flexible interconnecting scheme could be 
achieved by using directed interconnecting networks, the 
present study considers all layer and interconnecting net- 
works to be undirected because such a structure favors 
the analytical model developed in Section IVIII without 
loss of generality except for the respectively implied dou- 
bled average node degree. The connections implemented 
by the three subnetworks in Figure ^a-c) are illustrated 
in Figure ^d). 



IV. COMPUTATIONAL IMPLEMENTATION 

Knowledge networks involving the free and conditional 
edges described above can be conveniently represented 
in terms of an extended adjacency matrix 20] henceforth 
represented as K. Each node is labeled by consecutive 
integer values 1, 2, . . . , N. The equivalence between two 
nodes i and j is indicated by making K(i,j) = 1 and 
K(j, i) = 1. The single conditional connection from node 
i to j is represented as K(i,j) — 1 and K(j,i) = — 1. 
Note that such an assignment implements the adopted 
strategy that an implication edge can be backtraced un- 
conditionally. The multiple conditional transition from 
i\, 12, ■ ■ ■ , ia to j is represented as K(i p ,j) — 1 and 



4 



K(j,i p ) = —l,p = 1,2,..., A. Figure [U illustrates an 
extended adjacency matrix K considering BA models for 
layer and interconnecting networks. 

The moving agent keeps at all times a vector v of vis- 
ited nodes and an individual adjacency matrix G, which 
are continuously updated after each movement. The 
agent is granted to know about all feasible connections 
emanating from the current node i, while the feasibility 
of a given edge is decided by taking into account 

its list v of visited nodes. More specifically, an edge will 
be feasible, and accessible to the agent, in case it has 
already visited the required nodes and collected the re- 
spective tokens. 

The movement strategies described in the two follow- 
ing subsections have been considered in the reported sim- 
ulations. 



A. Random choice of edges 

In this case, the next edge to be taken from the current 
node i is drawn with the same probability between all 
the feasible connections between i and all other nodes. 
By feasible connection it is meant either a free edge or 
a conditional edge for which all conditions have already 
been met. 



B. Preferential choice of edges 

Unlike the previous case, the free edges which have not 
yet been tracked are considered first, with uniform proba- 
bility. In case no such edges exist, the next edge is drawn 
uniformly among the other allowed movements, i.e. free 
links which have already been tracked and enabled con- 
ditional links which remain untracked. The exclusion of 
the untracked conditional links, even if enabled, from the 
preferential movements is considered in order to express 
the fact that such a kind of knowledge enlargement is 
more demanding than exploring first the untracked un- 
conditional connections. 

Note that in neither case the agent uses its knowledge 
about the status (i.e. already visited or not) of the node 
to which the emanating edges lead to. Although more 
sophisticated moving strategies which make full use of 
the information stored in partial graph G stored with 
the agent can be devised, including the choice of shortest 
paths to unvisited edges, they are not pursued further in 
the present work. 

The two strategies above aim to represent, though very 
naively and incompletely, two possible ways to acquire 
knowledge. In the first case, no distinction is made be- 
tween a new or already taken relation. It is as if the 
researcher (i.e. the agent walking through the network) 
is equally interested in revising a relationship or seek- 
ing for new possible connections. In the second visiting 
scheme, the researcher is more actively interested in ex- 
ploring new relationships, resorting to already tracked 



connections or enabled conditional links only in case no 
untracked free links are available. Intuitively the second 
strategy would seem to be more effective in finding new 
knowledge, by covering the edges more effectively. 

V. SIMULATIONS 

For simplicity's sake, all simulations reported in this 
work are restricted to hierarchical networks with H = 5 
network layers, with all layer and interconnecting sub- 
networks having N(h) = N = 20 nodes, implying a to- 
tal of Q = 100 nodes for all layer subnetworks (larger 
networks involve much longer execution times). Random 
and Barabasi- Albert models are considered for layers and 
interconnections. The latter are defined by the number 
of edges m of each new added node, starting with mO 
nodes. For each such BA network, an 'equivalent' ran- 
dom network - in the sense of having the same average 
degree (k) and number of edges Ne - is obtained. Since 
the average degree of a BA network with m edges per 
node is known to be 

(k) = 2m, (1) 

the Poisson rate of the equivalent random (Erdos- 
Renyi) network with the same number of N nodes and 
same average degree (k) can be verified to be given as 

7e = 2m/(N _ i). ( 2 ) 

The above result follows from the fact that in an Erdos- 
Renyi network we have 7 = (k) /{N — 1). The number 
of edges in any of the BA or random networks can be 
calculated as 

N E = N/2 (k) . (3) 

Therefore, in this work the values of m are used to 
define the connectivity of the BA models and then of the 
respective random counterparts. 

Three configurations have been chosen for the BA 
layer models: m = 1,5 and 10, while eight configura- 
tions are considered for the interconnecting networks: 
m = 1, 2, . . . , 8. Table [I] shows the expected values of 
number of edges Ne, average degree (k), and Poisson 
rate 7 e for values of m ranging from 1 to 10. 

The following configurations were addressed in the re- 
ported simulations: 

(i) all layers and interconnecting subnetworks are BA; 

(ii) all layers subnetworks are random 
and all interconnecting networks are BA; 

(iii) all layers subnetworks are BA and all inter- 
connecting subnetworks are random; 

(iv) all layers and interconnecting subnetworks are 
random. 
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m 


N E 


(k) 


7e 


1 


20 


2 


0.105 


2 


40 


4 


0.210 


3 


60 


6 


0.316 


4 


80 


8 


0.421 


5 


100 


10 


0.526 


6 


120 


12 


0.631 


7 


140 


14 


0.737 


8 


160 


16 


0.842 


9 


180 


18 


0.947 


10 


200 


20 


1.05 



TABLE I: Values of m for the BA models and the respective 
total number of edges Ne, average degree (k) and equiva- 
lent Poisson rate 7 e expected for each subnetwork (layer or 
interconnecting) with N = 20 nodes. 



Each of the above configurations was investigated while 
considering two visiting strategies: (a) allowed edges are 
chosen randomly; and (b) if available, untracked allowed 
edges are selected randomly, otherwise allowed tracked 
edges are selected randomly. In order to assess the effect 
of the conditional edges between successive layers, coun- 
terparts of each considered configuration interconnected 
by unconditional networks have also been simulated and 
had their performance quantified. Although several al- 
ternative or complementary performance indices could 
have been considered, for simplicity's sake our attention 
is restricted to the percentage P of visited nodes and per- 
centage E of visited edges at time instant t. The speed 
of knowledge acquisition can be estimated by taking the 
time derivative of this quantity, i.e. P. A total of 100 re- 
alizations involving N t = 2400 time steps (corresponding 
to each movement along the walk) have been performed. 

Figure H3 shows the learning curves obtained for P(i) 
considering layer networks defined by m = 5 and five 
interconnecting layers with m = 1, 2, . . . , 8 for the sev- 
eral combinations of layer /interconnection types of net- 
works, presence of conditional connections between lay- 
ers, and consideration of the random choice of move- 
ment. The title of each graph is henceforth organized 
as (layer, interconn., movement, conditionality), where 
layer and inter conn, indicates the model assumed for the 
layer and interconnecting networks, movement identifies 
the moving agent strategy (random or preferential for 
new edges), and conditionality indicates the type of in- 
terconnecting edges (conditional or free/unconditional). 

Analogous results obtained for the preferential move- 
ments / conditional connections; random movements / 
unconditional connections and preferential movements / 
unconditional connections are given in Figures El El 
and |51 respectively. In all the remaining figures in this ar- 
ticle, the legend bar indicates the density of the intercon- 



nections. The values in these legends correspond to the 
parameter m adopted for the BA model, therefore defin- 
ing the density of interconnections for this model and also 
for the equivalent random counterpart (see Equation yj. 

Models considering different connectivities for the layer 
networks, namely m = 1 and m — 10, have also been sim- 
ulated and investigated. The percentage of visited nodes 
P(t) obtained for the preferential choice / conditional in- 
terconnections situation is shown in Figures E3 and 1111 
respectively. 

The percentages of visited edges E(t) at each time in- 
stant are given in Figures IT21 (random movements, condi- 
tional interconnections) , 1 131 (random movements, uncon- 
ditional interconnections), 1141 (preferential movements, 
conditional interconnections) and 1151 (preferential move- 
ments, unconditional interconnections). 

VI. DISCUSSION 

The results presented in the previous section are dis- 
cussed in the following with respect to the two main per- 
formance situations considered in this work: number of 
visited nodes and edges. 

A. Knowledge in terms of Visited Nodes 

Effects of conditional interconnections Compared to 
unconditional interconnections, the case of conditional 
interconnections tends to substantially reduce the knowl- 
edge acquisition speed. This was expected indeed, be- 
cause the conditional interconnections imply the moving 
agent to stay longer wandering at previous layer networks 
in order to collect the tokens necessary to proceed into 
new layers. 

Effects of the network models As can be easily inferred 
by comparing the left and right columns of figures 0- 
13 interconnections through BA subnetworks have about 
the same effect as random networks on the knowledge 
acquisition in all cases. This is mainly a consequence of 
the imposed similar connectivity of the BA and random 
counterpart models used for the interconnecting subnet- 
works. Similarly, the use of BA or random models for the 
layer networks also led to minimal effect on the knowl- 
edge acquisition dynamics. In brief, the type of network 
model, BA or random, had little effect on the overall 
knowledge acquisition efficiency. 

Effects of the density of interconnections Denser inter- 
connecting subnetworks tend to decrease the knowledge 
acquisition speed in the case of conditional interconnec- 
tions, having little effect for unconditional interconnec- 
tions (i.e. the learning curves are nearly identical what- 
ever the interconnecting density in Figures (7| and |5J. 
Such a behavior is explained because a larger number of 
conditional interconnections implies the moving agent to 
collect more tokens in the previous layers before proceed- 
ing to further layers. 
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Presence of Plateaux The preferential movement strat- 
egy defined for conditional interconnections has implied 
a series of plateaux of knowledge acquisition along the 
learning curves. The learning curves in Figure|S]are char- 
acterized by being preceeded by a quick acquisition stage, 
followed by the respective plateau, whose width tends to 
become larger as time goes by. These plateaux indicate 
a phase of knowledge stagnation, corresponding to the 
state of dynamics of the system where the walks proceed 
predominantly over edges in the previous layers, while 
the conditional links leading to the subsequent layers are 
not yet feasible. With this respect, it is possible to draw 
a naive analogy with a particle moving along a series of 
chambers limited by successive compartments which are 
progressively removed. Congruently, the plateaux tend 
to become larger along time because the walks have each 
time more alternatives of random movement among the 
feasible edges. This possibility is corroborated by the 
fact that the plateaux become more discernible for large 
interconnectivities (i.e. large values of m adopted for the 
interconnections in B A and random counterparts) , which 
imply more edges between subsequent layers. The (pos- 
sibly counterintuitive) tendency of the preferential move- 
ments to reduce the knowledge acquisition rate when 
compared to the random strategy can be explained by 
the fact that in the preferential case the agent is forced to 
waste time going through untracked edges in both layer 
and interconnecting networks even cases where most ad- 
jacent nodes have already been visited. 

An important issue related to the characterization of 
such plateaux is whether the several individual trajecto- 
ries obtained during simulation are well represented in 
terms of their respective average values. Figure ll^f a) 
shows 100 such different trajectories obtained for 5 lay- 
ers with N = 20 and m = 10, interconnected through 
networks with m = 8. The average and standard devia- 
tions of these trajectories are shown in Figure It is 
clear from these figures that the trajectories do present 
small dispersion, being therefore properly represented in 
terms of the respective average. Note also the first short 
plateau with height 0.2, which was not visible in the pre- 
vious figures because of the smaller resolution of those 
pictures. 

Additional insight about the evolution of the ratio of 
visited nodes in the presence of conditional edges can be 
obtained by considering the number of the layer visited 
by the agent along the random walk steps. Such a curve 
is illustrated in Figure IT7I for specific realizations consid- 
ering random (a) and preferential (b) agent movements. 
It is clear from this figure that the preferential random 
walk implies the agent to explore most of the nodes in the 
current layer, while seeking for free edges, before proceed- 
ing to explore the subsequent layers, therefore implying 
the formation of plateaux. 

Layer Networks with other Connectivities In order to 
investigate the effect of the connectivity of the layer net- 
works on the overall knowledge acquisition dynamics, the 
above simulations were performed also for m = 1 and 10 



(all other situations discussed in this subsection refer to 
layer networks with m = 5). It is clear from Figures ITUI 
and II II that the larger number of edges in each layer im- 
plied by m = 10 tends to substantially slow down the 
node coverage and to yield more marked plateaux. 

The overall fastest knowledge acquisition was observed 
for the cases involving free transitions, with some speed 
up verified for the preferential movement strategy (i.e. 
Figure EJl. 



B. Knowledge in terms of Visited Edges 

The dynamics of knowledge acquisition can also be 
quantified in terms of the percentage E(t) of visited edges, 
which provides additional insights about the considered 
models and strategies. These results are shown in Fig- 
ures^ltoEl The curves obtained for random movements 
(i.e. Figures IT21 and IT5|) are quite similar, indicating that 
the presence of conditional edges has little effect On the 
edge coverage under the random movement strategy. The 
results obtained for preferential movements and condi- 
tional transitions (i.e. Figure [SJ indicate that the edges 
are covered less effectivey than in the two previous cases, 
especially for denser interconnections. The fastest cover- 
age of edges was clearly obtained for preferential move- 
ments with free transitions (i.e. I15fl . which is a direct 
consequence of the preference for new edges imposed by 
that strategy. Such a fast edge coverage is also accom- 
panied by the fastest node coverage in Figure [5] Also 
interesting is the fact that, though the edge coverage ob- 
tained for random movements (i.e. Figures El an d I13|) 
resulted quite similar, the node coverage was verified to 
be much faster in the former situation than in the lat- 
ter. Actually, the case involving random movements and 
conditional transitions is characterized by the fact that 
most nodes are covered after approximately 1500 basic 
time steps (see Figure [SJ even though only a fraction of 
respective edges have been covered at that time, as indi- 
cated by Figure IT2"1 

In order to better understand the preferential dynam- 
ics, let us first consider its initial stages, where the agent 
starts its exploration of the first and second layers. Be- 
cause preference is given to free untracked edges, the 
agent tends to remain in layer 1 until most of its free 
edges are tracked. At this point, not only few free 
edges remain untracked, but also most conditional links 
have been enabled. Therefore, in presence of few un- 
tracked free edges, the agent considers more frequently 
movements going through tracked or enabled conditional 
edges, the latter leading to higher hierarchies. At subse- 
quent stages, when the agent is exploring a higher hierar- 
chy, it will tend to go through the free untracked edges, 
which now include the conditional edges leading back to 
previous layers, proceeding into higher layers only when 
the edges within and between the previous layers have 
been mostly covered, so that few preferential movements 
are allowed and the agent now considers more frequently 
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going through already tracked links on the current or pre- 
vious layers or enabled conditional links leading to higher 
layers. 

The lack of plateaux in traditional random walks in 
presence of conditional links can now more easily under- 
stood as being a consequence of the fact that, by treating 
free or enabled conditional edges with the same priority, 
irrespectively of being already traked or not, allows more 
frequent explorations of the enabled conditional edges 
leading to higher hierarchies. Note that even at the ear- 
liest stages of the exploration depicted in Figure ??(a), 
the agent manages to get as far as the last layer. In 
this way, plateaux of stagnation are completely avoided. 
However, it remains an interesting fact that the barriers 
of conditional links are overcomed with relative ease by 
the moving agent. 



VII. ANALYTICAL MODEL 



beginning of the random walk, the exploration is limited 
to layer h = 1, so that P c is reached at a critical step 
t c so that P(t c ) — P c which can be calculated through 
Equation 0] 

Afterwards, all conditional links at layer 2 are enabled, 
so that the exploration of that layer begins. However, as 
the interconnecting edges are bidirectional, the agent will 
now exchange between layers 1 and 2 until the ratio Pc 
is achieved for layer 2. The respective occupancy of these 
two layers can be approximated in terms of the Markov 
chain shown in Figure ITS1 

The stochastic matrix S associated to this Markovian 
system is immediately obtained as 



So 



(fc) 

W+2(fe i > 



2{fc i ) 
(fc)+2(fc,) 



(k)+2(ki) (k)+2(ki) 



(5) 



The several interesting dynamical features so far iden- 
tified through numerical simulations are investigated fur- 
ther, especially regarding their behavior under scaling of 
the network sizes and number of layers, in the present 
section through a simplified analytical model. Though 
we limit this investigation to preferential random walks 
in the networks, more specifically the case leading to 
plateaux, the other situations considered in this article 
can be treated similarly. 

We start by considering the fact that, at step t of a 
random walk preferential to untracked edges, the ratio 
of visited nodes P(t) of a random or BA network with 
N nodes and free links has been been verified, through 
extensive simulations, to be approximated (at least for 
10 < N 1000) as 



where the factor 2 stands for the fact that one undi- 
rected edge in the interconnecting layer implies two con- 
ditional links between layer 1 and 2 (see Figure 0J. 

Although the moving agent will soon be spending the 
same proportion of time at layers 1 or 2, it is the rela- 
tive frequency F of time steps at which the moving agent 
remains at or enters into layer 2 that matters for the cov- 
erage of the nodes in this layer and respective acquisition 
of prerequisites. This frequency is immediately given as 
being equal to the relative number of movements through 
the two edges leading to node 2. Tt follows by symmetry 
that F = 0.5, implying the exploration of layer 2 to be 
effectively performed at a renormalized P(t) given as 



Pit) = 1 



-t/N 



(4) 



P(t) = 1 - e-°- 5t / N 



(6) 



We shall make a small modification of the way in which 
the conditional edges are considered so as to simplify the 
analytical characterization of the knowledge acquisition 
dynamics. More specifically, starting at layer h = 1, 
movements to the subsequent layer h + 1 — 2 will only 
be allowed after a ratio P c of visited nodes in h has been 
achieved. The difference between such an assumption 
and the situations so far considered in the present article 
is that in the latter situation the moving agent is al- 
lowed to explore subsequent layers at any time, provided 
it holds the respective prerequisites. However, the above 
simplification holds particularly well when the connec- 
tivity between subsequent layers is relatively large with 
respect to the connectivity between the nodes in each 
layer, because in such a situation most movements in the 
subsequent layer will be mostly blocked by the prerequi- 
sites and preference to free untracked edges in the current 
and previous layers. 

Let (fc) be the average degree at each layer and (hi) be 
the average degree of the interconnecting layer. At the 



Note that t is relative to each exploration stage, al- 
ways starting when the critical ratio of visited nodes is 
achieved for the last current layer. After liberating layer 
3 for exploration, the Markov model becomes as shown 
in Figure lTHT b). and the respective stochastic matrix now 
reads 



S,= 



(fc) 

<fc>+2(fe i > 

2{fc.) 
<fc)+2<fc<> 




2{fc < ) 
(fc)+4(fc.,) 

w 

<fe)+4<fe.,) 
gjfcj) 





gjfej) 

<fc)+2(fe l 



(fe)+4(/ci> (k)+2(ki) 



(7) 



Note that all subsequent stochastic matrices will share 
the right-hand lower 3x2 block with the above matrix, 
from which a generic probabilistic model can be devel- 
oped. At a generic hierarchical level h, these four ele- 
ments are given as follows 
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Sh-iji-i = 

Sh-l.h = 



2{h 



{k)+A(h) 
(k) 

(k)+4(h) 

2(h) 
(k) + 2{ki) 

Sh,h-1 = Sh-2,h-l 
and 

(A) 



Sh.h 



(k) + 2(h) 



(8) 

(9) 

(10) 
(11) 

(12) 



Because of the inherent symmetry of the transition 
probabilities in the matrix S, the occupancy p(h) of each 
state h can be calculated as 



p(h) 



2+(h-2)(l+S(h-l,h)) 



l+S(h-l,h) 
2+(h-2)(l+S(h-l,h)) 



for h = 1 or h = H 



otherwise 



As the relative frequency in which each of these tran- 
sitions S(a, b) are performed is immediately given as 
p(b)S(a, b), we have that the relative frequency of move- 
ments F(h) into the last currently enabled layer h is 
therefore given as 



F(h) 



(14) 



/?+(/* -2) 7 
where 

a = p(h - l)Sh,h-i +p{h)S hih 
p = 2(p(h)S h - 1 , h +p(h)S h , h ) 
7 = 2p(h ~ l)Sh-2,h-i + p(h - l)Sh-i t h-i 

By substituting Equations 181-1121 and ED hito Equa- 
tion O it follows that 



Equation 1161 in terms of the value F(h). Consequently, 
the length of each stage along the liberation of the layers 
will be given as 



L(h) = l/F{h) 



B + hC 
A 



(17) 



where A, B and C are constants defined by (k) and 
(hi). It is now clear that this length scales proportion- 
ally to h. Equation 1161 also provides the means for an- 
alyzing the scaling of P(t) with N. Because the coef- 
ficient c of the exponential in P(t), i.e. c = F(h)/N, 
corresponds to a product between F(h) and 1/N, the 
scaling of the subnetworks size from N to aN will im- 
ply c = (F(h)/a)N = c/a for all h, i.e. the length of 
all stages L(h) will be equal to aL(h). In other words, 
the overall shape of the ratio of visited nodes will not 
change when N is scaled while all other parameters are 
kept fixed. 

FigurelT9"lillustrates the evolution of the ratio of visited 
nodes in a BA network as estimated by the above model 
assuming TV = 20, (k) = 10, (h) = 16, P c = 0.9999, and 
five layers. Except for the value P c , these parameters 
correspond precisely to those considered in the evolution 
shown in Figure lTBl The specific value of P c was chosen so 
as to obtain proper fitting between the experimental data 
and the theoretical model. A good overall adherence can 
be observed between the analytical and respective experi- 
mental evolutions regarding both the lengths and heights 
of each plateau. Interestingly, the analytical model also 
captures the fact that progressively smoother transitions 
are obtained at higher hierarchies. The main difference 
between these evolutions are related to the the fact that, 
in the original experiment, the moving agent was allowed 
to proceed to subsequent layers more freely, i.e. before 
the critical ratio of visited nodes P c had been reached. 
Such a dynamics would contribute to smoothing the lenf- 
hand side of the evolution curve at each transition, as is 
the case in the experimental results in Figure [TBI 



F ^ - BThC ^ 
where 

A=(k) 2 +6(k) (fci) + 8(fci) a 

and 

B = -4(h)(k)-l6{hf 
C = (kf + 8 (h) (k) + 16 {hf 

The evolution of the ratio of visited nodes can therefore 
be estimated by using Equations |H (for h = 1) andlPol (for 
h>2). 

P(t) = 1 - e - F W*/JV (16) 

This implies the overall evolution to be composed by 
subsequent time scaled versions of Equation 0] given by 



VIII. CONCLUDING REMARKS 

This article has presented a simple approach to knowl- 
edge acquisition based on representation of knowledge 
as a hierarchical complex network Q and the model- 
ing of the process of knowledge acquisition in terms of 
walks along such networks. Though simple, the consid- 
ered models incorporate the existence of two types of 
edges (free and conditional), including multiple condi- 
tional transitions where the access to specific nodes are 
granted only after the agent has visited specific nodes. 
This movement strategy represents a possibly new mech- 
anism for complex network and random walk researches. 

Two visiting strategies have been considered: at ran- 
dom and preferential to still untracked free edges. Sim- 
ulations considering several densities of connectivity be- 
tween 5 hierarchical layers have been evaluated with re- 
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spect to conditional interconnecting networks and un- 
conditional counterparts, and the knowledge acquisition 
dynamics quantified in terms of the number of visited 
nodes and edges as a function of time (i.e. each basic 
movement of the agent). A Markovian analytical model 
of the learning dynamics is developed for the case of pref- 
erential random walks in presence of conditional links, 
which reproduces the plateaux heights and lengths. This 
model has allowed the discussion of the scaling proper- 
ties of the dynamics with respect to the network size and 
number of layers. Among other findings, the lengths of 
the plateaux have been verified to be proportional to the 
number of already explored layers. 

Despite the simplicity of the approach, a series of inter- 
esting complex dynamics and effects have been identified 
from the learning curves, including the fact that the pref- 
erential movement strategy was slower than the random 
counterpart for the case of conditional interconnections, 
as well as the identification of plateaux of stagnation of 
learning for the latter situation. 

The reported work has paved the way to several future 
works, including the consideration of multiple agents |l8j , 
which may or may not share information about their in- 
dividual adjacency matrices. Another relevant issue to 



be incorporated into the model is the fact that the tran- 
sitions from one node to another, i.e. the inference of 
some subset of knowledge from another, may not always 
take the same time. It would therefore be interesting 
to consider diverse distributions of time-weights along 
the hierarchical knowledge networks. Also interesting is 
the fact that the suggested approach and models provide 
an interesting framework for investigating data flow ar- 
chitectures (e.g. ^t|). This type of parallel computing 
architecture is characterized by a hierarchical processing 
flow constrained by dependences between intermediate 
computing stages, which could be conveniently modeled 
by the hierarchical complex networks with multiple con- 
ditional edges. It would be interesting to consider addi- 
tional measurements typically used in random walk in- 
vestigations, such as return time and correlations. 
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(a) 



Layer h 




FIG. 4: Two layers (a,c) and one interconnecting (b) sim- 
ple subnetworks and the section of the hierarchical network 
respectively implemented (d). 



FIG. 5: Example of extended adjacency matrix K considering 
BA layers and random interconnections. The conditional con- 
nections are represented in white and the equivalence edges 
in gray. 
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FIG. 6: The percentage of visited nodes P in terms of the time t for the configuration involving random choice of edges and 
conditional interconnections. The legend indicates the density of interconnections in terms of m. 
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FIG. 7: The percentage of visited nodes P in terms of the ti 
unconditional interconnections. 
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FIG. 8: The percentage of visited nodes P in terms of the time t for the configuration involving preferential choice of edges 
and conditional interconnections. 
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(BA, rand. , pref. , free) 
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FIG. 9: The percentage of visited nodes P in terms of the time t for the configuration involving preferential choice of edges 
and unconditional interconnections. 




FIG. 10: The percentage of visited nodes P in terms of the time t for the configuration involving preferential choice of edges 
and conditional interconnections. All layer networks consider m — 1 
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FIG. 11: The percentage of visited nodes P in terms of the time t for the configuration involving preferential choice of edges 
and conditional interconnections. All layer networks consider m = 10 




FIG. 12: The percentage of visited edges E in terms of the time t for the configuration involving random choice of edges and 
conditional interconnections. 
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FIG. 13: The percentage of visited edges E in terms of the time t for the configuration involving random choice of edges and 
unconditional interconnections. 




FIG. 14: The percentage of visited edges E in terms of the time t for the configuration involving preferential choice of edges 
and conditional interconnections. 
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FIG. 15: The percentage of visited edges E in terms of the time t for the configuration involving preferential choice of edges 
and unconditional interconnections. 
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FIG. 16: Visualization of 100 different trajectories obtained for 5 layers with N = 20 and m = 10, interconnected through 
networks with m = 8 (a), and respective average and standard deviations (b). 




FIG. 17: The number of layer occupied by the moving agent in terms of the random walk steps t for specific realizations 
considering random (a) and preferential (b) movements. 
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FIG. 18: The movement of the agent between subsequent hierarchical layers can be modeled in terms of Markovian models, as 
illustrated for two (a) and three (b) layers. Note that the progress to the subsequent layers are blocked until the critical ratio 
of visited nodes P c is achieved for the last enabled layer. 
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FIG. 19: Analytical evolution of the ratio of visited nodes for a random network with N = 20, (k) = 10, = 16, P c = 0.9999, 
and five layers. 



